A simulation study comparing likelihood and non-likelihood approaches in analyzing overdispersed count data
نویسندگان
چکیده
Overdispersed count data are modelled with likelihood and non-likelihood approaches. Likelihood approaches include the Poisson mixtures with three distributions, the gamma, the lognormal, and the inverse Gaussian distributions. Non-likelihood approaches include the robust sandwich estimator and quasilikelihood. In this simulation study, overdispersed count data were simulated under the Poisson mixtures with the gamma, the lognormal and the inverse Gaussian distributions, then analyzed with the five likelihood and non-likelihood approaches. Our results indicated that 1) when the count data are mildly overdispersed, there are virtually no differences in type I error rate, standard error of the main effect, and empirical power among the five methods; 2) when the count data are very overdispersed, none of these five approaches is robust to model misspecification as evaluated by type I error rate, standard error of the main effect, and empirical power. This simulation study raises caution on using non-likelihood method for analyzing very overdispered count data because of likely higher type I error and inappropriate power levels. Unlike non-likelihood approaches, likelihood approaches allow for statistical tests based on likelihood ratios and for checking model fit and provide basis for power and sample size calculations. When likelihood approaches are used, we suggest comparing likelihood values to select the appropriate parametric method for analyzing very overdispersed count data. AMS 2000 subject classifications: Primary 60K35, 60K35; secondary 60K35.
منابع مشابه
Beta - Binomial and Ordinal Joint Model with Random Effects for Analyzing Mixed Longitudinal Responses
The analysis of discrete mixed responses is an important statistical issue in various sciences. Ordinal and overdispersed binomial variables are discrete. Overdispersed binomial data are a sum of correlated Bernoulli experiments with equal success probabilities. In this paper, a joint model with random effects is proposed for analyzing mixed overdispersed binomial and ordinal longitudinal respo...
متن کاملThe Development of Maximum Likelihood Estimation Approaches for Adaptive Estimation of Free Speed and Critical Density in Vehicle Freeways
The performance of many traffic control strategies depends on how much the traffic flow models have been accurately calibrated. One of the most applicable traffic flow model in traffic control and management is LWR or METANET model. Practically, key parameters in LWR model, including free flow speed and critical density, are parameterized using flow and speed measurements gathered by inductive ...
متن کاملModified Maximum Likelihood Estimation in First-Order Autoregressive Moving Average Models with some Non-Normal Residuals
When modeling time series data using autoregressive-moving average processes, it is a common practice to presume that the residuals are normally distributed. However, sometimes we encounter non-normal residuals and asymmetry of data marginal distribution. Despite widespread use of pure autoregressive processes for modeling non-normal time series, the autoregressive-moving average models have le...
متن کاملModified signed log-likelihood test for the coefficient of variation of an inverse Gaussian population
In this paper, we consider the problem of two sided hypothesis testing for the parameter of coefficient of variation of an inverse Gaussian population. An approach used here is the modified signed log-likelihood ratio (MSLR) method which is the modification of traditional signed log-likelihood ratio test. Previous works show that this proposed method has third-order accuracy whereas the traditi...
متن کاملOn the EM algorithm for overdispersed count data.
In this paper, we consider the use of the EM algorithm for the fitting of distributions by maximum likelihood to overdispersed count data. In the course of this, we also provide a review of various approaches that have been proposed for the analysis of such data. As the Poisson and binomial regression models, which are often adopted in the first instance for these analyses, are particular examp...
متن کامل